Constructing Finite State Machines for Fast Gesture Recognition
نویسندگان
چکیده
This paper proposes an approach to 2D gesture recognition that models each gesture as a Finite State Machine (FSM) in spatial-temporal space. The model construction works in a semi-automatic way. The structure of the model is first manually decided based on the observation of the spatial topology of the data. The model is refined iteratively between two stages: data segmentation and model training. Given the continuous training data of a single gesture, we roughly segment the gesture trajectory into phrases using the spatial information alone. The segmentation results are used to initialize an FSM. The model is used to re-segment the data. The results of the re-segmentation are used to refine the parameters of the model. After the FSM is trained, we incorporate a modified Knuth-Morris-Pratt algorithm into the FSM recognition procedure to speed up the gesture recognition. The computational efficiency of the FSM recognizers allows real-time on-line performance to be achieved.
منابع مشابه
How Finite State Machines Can Be Used to Build Error Free Multimodal Interaction Systems
Recognition-based interaction technologies (e.g. speech and gesture recognition) are still error-prone. It has been shown that, in multimodal architectures, combining complementary input modes can contribute to automatic recovery from recognition errors. However, the degree to which error recovery can be achieved is dependent on the design of the interaction, i.e. on the set of multimodal const...
متن کاملGesture Modeling and Recognition Using Finite State Machines
This paper proposes a state based approach to gesture learning and recognition. Using spatial clustering and temporal alignment, each gesture is defined to be an ordered sequence of states in spatial-temporal space. The 2D image positions of the centers of the head and both hands of the user are used as features; these are located by a color based tracking method. From training data of a given ...
متن کاملVirtual Document Projector Camera
This paper describes techniques for the design of a system able to interact with the user by visual recognition of hand gestures. The system is composed of three modules including tracking, posture classii-cation and gesture recognition. A description of each module is given. In order to increase the robustness and the precision of the tracking, several complementary tracking processes are coup...
متن کاملA hand gesture recognition technique for human-computer interaction
We propose an approach to recognize trajectory-based dynamic hand gestures in real time for human–computer interaction (HCI). We also introduce a fast learning mechanism that does not require extensive training data to teach gestures to the system. We use a six-degrees-of-freedom position tracker to collect trajectory data and represent gestures as an ordered sequence of directional movements i...
متن کاملA Toolkit for Creating and Testing Multimodal Interface Designs
Designing and implementing applications that can handle multiple recognition-based interaction technologies such as speech and gesture inputs is a difficult task. IMBuilder and MEngine are the two components of a new toolkit for rapidly creating and testing multimodal interface designs. First, an interaction model is specified in the form of a collection of finite state machines, using a simple...
متن کامل